Identifying Outlier Arms in Multi-Armed Bandit
نویسندگان
چکیده
We study a novel problem lying at the intersection of two areas: multi-armed banditand outlier detection. Multi-armed bandit is a useful tool to model the processof incrementally collecting data for multiple objects in a decision space. Outlierdetection is a powerful method to narrow down the attention to a few objects afterthe data for them are collected. However, no one has studied how to detect outlierobjects while incrementally collecting data for them, which is necessary when datacollection is expensive. We formalize this problem as identifying outlier arms in amulti-armed bandit. We propose two sampling strategies with theoretical guarantee,and analyze their sampling efficiency. Our experimental results on both syntheticand real data show that our solution saves 70-99% of data collection cost frombaseline while having nearly perfect accuracy.
منابع مشابه
Multiple Identifications in Multi-Armed Bandits
We study the problem of identifying the top m arms in a multi-armed bandit game. Our proposed solution relies on a new algorithm based on successive rejects of the seemingly bad arms, and successive accepts of the good ones. This algorithmic contribution allows to tackle other multiple identifications settings that were previously out of reach. In particular we show that this idea of successive...
متن کاملOn Finding the Largest Mean Among Many
Sampling from distributions to find the one with the largest mean arises in a broad range of applications, and it can be mathematically modeled as a multi-armed bandit problem in which each distribution is associated with an arm. This paper studies the sample complexity of identifying the best arm (largest mean) in a multi-armed bandit problem. Motivated by large-scale applications, we are espe...
متن کاملSkyline Identification in Multi-Armed Bandits
We introduce a variant of the classical PAC multi-armed bandit problem. There is an ordered set of n arms A[1], . . . , A[n], each with some stochastic reward drawn from some unknown bounded distribution. The goal is to identify the skyline of the set A, consisting of all arms A[i] such that A[i] has larger expected reward than all lower-numbered arms A[1], . . . , A[i− 1]. We define a natural ...
متن کاملMistake Bounds on Noise-Free Multi-Armed Bandit Game
We study the {0, 1}-loss version of adaptive adversarial multi-armed bandit problems with α(≥ 1) lossless arms. For the problem, we show a tight bound K − α − Θ(1/T ) on the minimax expected number of mistakes (1-losses), where K is the number of arms and T is the number of rounds.
متن کاملPAC Bounds for Multi-armed Bandit and Markov Decision Processes
The bandit problem is revisited and considered under the PAC model. Our main contribution in this part is to show that given n arms, it suffices to pull the arms O( n 2 log 1 δ ) times to find an -optimal arm with probability of at least 1 − δ. This is in contrast to the naive bound of O( n 2 log n δ ). We derive another algorithm whose complexity depends on the specific setting of the rewards,...
متن کامل